The structure of reversible computation determines the self-duality of quantum theory 
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Predictions for measurement outcomes in physical theories are usually computed by combining two distinct 
notions: a state, describing the physical system, and an obsen'able, describing the measurement which is per- 
formed. In quantum theory, however, both notions are in some sense identical: outcome probabilities are given 
by the overlap between two state vectors ~ quantum theory is self-dual. In this paper, we show that this notion of 
self-duality can be understood from a dynamical point of view. We prove that self-duality follows from a com- 
putational primitive called bit symmetry: every logical bit can be mapped to any other logical bit by a reversible 
transformation. Specifically, we consider probabilistic theories more general than quantum theory, and prove 
that every bit-symmetric theory must necessarily be self-dual. We also show that bit symmetry yields stronger 
restrictions on the set of allowed bipartite states than the no-signalling principle alone, suggesting reversible 
time evolution as a possible reason for limitations of non-locality. 



A central idea of every statistical physical theory is the dis- 
tinction between states and observables. If we perform a mea- 
surement on a physical system, the state describes the prepa- 
ration of the system, while the observable corresponds to our 
choice of measurement. Combining the two, we obtain expec- 
tation values of measurement outcomes. 

In principle, states and observables are fundamentally dis- 
tinct objects. However, in quantum theory, they turn out to be 
identical: transition probabilities between two states \(p) and 
IV') are given by the overlap 

Prob(V ^ ^) = Mij)\^ = Tv{\^){ip\ \^){^\) . (1) 

More generally, the probability of obtaining an outcome de- 
scribed by the projector or effect operator P, measured on a 
(mixed) quantum state p, is given by Tr(pP). It is remark- 
able that state p and observable P are described by the same 
mathematical objects: up to normalization, they are both ar- 
bitrary positive semidefinite operators lITSl . This property of 
self-duality, which is most obvious in the special case ([TJ, lies 
at the very heart of quantum theory, and can be understood as 
the main ingredient in the Born rule. 

In this paper we show that this remarkable property can 
be understood in information-theoretic terms: self-duality is 
a consequence of a certain computational primitive that we 
call bit symmetry. Every theory that satisfies bit symmetry 
- which we argue is necessary to allow for powerful compu- 
tation - must be self-dual. We also prove that bit symmetry 
restricts the set of possible bipartite states in all theories with 
non-locality, including quantum theory. 

General probabilistic theories. Almost any conceivable sta- 
tistical physical theory, including quantum theory and clas- 
sical probability theory as special cases, can be described 
within the framework of general probabilistic theories ||I}{5]. 
The main physical notions are preparations, transformations, 
and measurements. Any physical system is described by a 
finite-dimensional real vector space A. The possible prepara- 
tion procedures are represented by a set of normalized states 
C A (in quantum theory, A is the set of self-adjoint op- 
erators on some Hilbert space, while Q,a is the set of density 
matrices). If we have two states (p,uj E f2^, we can think of 



a device which prepares either state tp with probability p, or uj 
with probability 1 — p, yielding the state ptp + (1 — p)uj |4). 
Therefore, state spaces are convex. Similarly as in quantum 
theory, states will be called mixed if they can be written as a 
convex combination of this form for some < p < 1 and 
ip ^ UJ, and otherwise pure. We also assume that state spaces 
are compact, which implies that every state can be written as 
a finite convex combination of pure states [2J. 




FIG. 1: Two state spaces: one is a square, the other a pentagon. 
Shown are pairs of perfectly distinguishable states u, if and a;', ip' . 
For the square, there is no symmetry which maps the pair uj,ip to 
the pair uj' ,f': the square state space is not bit-symmetric. For the 
pentagon, the pair a;, (^3 is mapped to uj' , ip' by a reflection across 
a symmetry axis. All pairs of perfectly distinguishable pure states 
can be mapped to each other - the pentagon is bit-symmetric. The 
dotted lines denote the level sets of a measurement effect E which 
distinguishes uj and ip (and, accidentally, also uj' and p'). That is, 
the line containing uj is {x : E{x) = 1}, and the line containing p 
is {a:: : E{x) = 0}. For the square state space, there are two types 
of inequivalent logical bits: lines generated by adjacent pure states 
like UJ, ip, and the square itself which is generated by diametral states 
like oj' ,p' . For the pentagon - and any other bit-symmetric theory 
- all logical bits generated by pairs of perfectly distinguishable pure 
states are isometric (in this case, all pairs generate the full pentagon). 

It is important for calculations to include unnormalized 
states in the framework, that is, elements of the form A • uj 
for A > and uj e Ha- The set of all these elements is called 
A+. It is closed with respect to sums and convex combina- 
tions - in convex geometry, sets of this kind are called cones. 
We assume that A^ spans the whole space A. In quantum 
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theory, A+ is the set of positive semidefinite matrices. 

In order to describe observables, consider any measurement 
with several possible outcomes that we perform on a state 
cj. Denote by E{oj) the probability of obtaining one partic- 
ular outcome. This must be a number between and 1, and 
it must respect probabilistic mixtures: E {pip + (1 — p)uj) = 
pE{(f) + (1 — p)E{uj); that is, E must be linear |T|. Linear 
maps i? : A — > R (i.e. functionals) which are non-negative on 
all of will be called effects, and the set of all effects is de- 
noted A*^_. It is easy to see that A*^_ is again a cone - in convex 
geometry terms, it is called the dual cone of A^ 161. The nor- 
malization of states is determined by the unit u, a particular 
element of A*^ which assigns the value one to all normalized 
states: u{uj) — 1 for all uj E Ha (in quantum theory, we have 
u{p) = Tr(p)). An effect E e A*^_ is called a proper effect if 
< E{oj) < 1 for all states uj e VIa- 

In quantum theory, all effects can be written as maps p i-> 
Tr{pP), where P > is a positive semidefinite matrix; it 
is proper iff P < 1. Identifying this effect with the matrix 
P, we see that can be identified with the set of positive 
semidefinite matrices, such that A+ ~ A*^. This is the notion 
of self-duality which will be studied in more detail in the next 
section. At this point, however, it is important to note that 
A^ and A*^ can be very different in general. As an example, 
consider a state space 

Ha := { {xi,X2, l)"^ G | - 1 < Xi,X2 < 1} . (2) 

This state space looks like a square. It contains four pure 
states, for example ui = (1,1,1)^ and (p = (—1,-1,1)-^, 
and has unit u{x) := x^. Using the standard inner product on 
A — Wi^ and the pure state uj, we can define a linear map E^^ 
by E^{x) :— {uj, x) — xi +0:2 + 2^3. Even though cj is a valid 
state, Ei^ is not a valid effect: for example E^{if) = —1^0. 
For the square state space, A+ and A*i^ cannot be identified 
in this way - they will be different no matter which inner 
product we use |7|. 

Self-duality. Building on the previous examples, we define 
a system A to be self-dual |fT6 l iff there is some inner product 
(•, •) OTi A such that the set of effects (represented as vectors 
in A) agrees with the set of states, A*j^ = A+\ that is, 

A\^{uj^ {w, if) \ipEA+}. 

Quantum theory is self-dual. To see this, recall that for an n- 
level quantum system, the real vector space A is the set of self- 
adjoint n X n-matrices. Consider the Hilbert Schmidt inner 
product on A, given by (X, Y) Tt:{XY). As we have seen 
above, under this inner product, we can identify A+ and A*^: 
both are the set of positive semidefinite matrices. 

As another example, it can be shown that the square 
state space Q is not self-dual Q, as already indicated. 
More generally, regular polygons with n vertices are self- 
dual if and only if n is odd. This will become important below. 

Bit symmetry. In addition to preparations (states) and mea- 
surements (effects), physical theories also contain a notion of 



transformations. Transformations describe on the one hand 
possible physical time evolution, and on the other hand pos- 
sible computations that can be accomplished in the respective 
theory. In this paper, we will only consider reversible trans- 
formations. This is motivated by the fact that time evolution 
in our universe seems to be fundamentally reversible, and also 
by the conceptual analogy to the reversible circuit model in 
quantum computation. 

Transformations must be linear (since they must respect 
probabilistic mixtures 11]), preserve the normalization, and 
map states to states. For reversible transformations T, this 
must also be true for their inverses. Consequently, they must 
be symmetries of the state space: T{Q,a) — ^a- There- 
fore, the set of reversible transformations on a system A is 
a group Qa, which is a subgroup of all symmetries. We as- 
sume that Qa is compact, which may be motivated on physical 
grounds |8 1. In quantum theory, Qa is the group of unitaries. 

We are interested in a particular type of symmetry which 
connects all logical bits. To this end, we call two states <y3, w G 
Ha perfectly distinguishable if there is a proper effect E such 
that E{lp) — and E{uj) = 1 - that is, if there is a conceivable 
measurement device that distinguishes Lp and lo perfectly in a 
single run. Since all states -0 have < ^ 1^ the states 

tf and u) must lie on opposite sides of state space: the set of 
vectors x E A with E{x) ~ 1 resp. E{x) = are two parallel 
supporting hyperplanes, touching the state space in p and lo, 
with the full state space lying in between, as sketched in Fig.[T] 

Every pair of pure and perfectly distinguishable states p 
and uj generate a logical bit: in terms of convex geometry, 
this is the face generated by </? and w, that is, the smallest 
face 1 17 1 of Ha containing both ip and w. In quantum theory, 
two pure states \lp) {(f \ and jw) (w| are perfectly distinguishable 
if and only if ((^|cl>) = 0. The logical bit that they generate 
is not simply the line segment making up their convex hull, 
but contains all pure states of the form a\Lp) + /3|a;) and their 
convex mixtures - that is, a full Bloch ball ifTSl . 

Now we are ready to define our main notion: a system A 
is called bit-symmetric, if one of the two following equivalent 
conditions holds: 

• If (p, w are perfectly distinguishable pure states, and so 
are (^', cj', then there is a reversible transformation T E 
Qa such that Tip = p' and Tuj — oj' . 

• Every logical bit can be mapped to every other logical 
bit by some reversible transformation. 

Quantum theory is obviously bit-symmetric: every pair of 
orthogonal pure states can be mapped to every other by some 
unitary. It is even more symmetric than this: analogous state- 
ments hold for triples, quadruples, etc., of orthogonal pure 
states. As a less trivial example, consider state spaces that are 
regular polygons with n vertices. It turns out that these state 
spaces are bit-symmetric if and only if n is odd. In Fig.[T] this 
is illustrated for n = 4 and n = 5, i.e. for the square and the 
pentagon. 

Classical probability theory is bit-symmetric as well: the 
n-outcome state space is the set of probability distributions 
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(pi, . . . ,Pn), TliiPi = Pi ^ 0. Geometrically, this con- 
vex set is a simplex, and the pure states are of the form 
(0, . . . , 0, 1, 0, . . . , 0) (full weight on one outcome). The re- 
versible transformations are the permutations of the n entries, 
which can map every pair of pure states to every other In fact, 
these "transpositions" generate the full group of permutations. 

As the last example illustrates, bit symmetry is an im- 
portant and basic computational primitive. In the context 
of quantum computation, it implies that any "entangled" 
logical bit that appears in a computation on many qubits 
can in principle be mapped to the first qubit (awaiting a 
final measurement) without destroying coherence. In general 
theories, bit symmetry means that yes-no-questions which can 
be answered perfectly by (irreversible) measurements may in 
principle also be asked "coherently" and be part of a larger 
reversible computation. In physical terms, it means that the 
state of any natural two-level system can be transferred to any 
other two-level system by a suitable reversible interaction. 
One may argue that the time evolution of the universe would 
be severely constrained if this property did not hold. 

Main Result. Now we prove our main theorem: 

Theorem 1 If a state space is bit-symmetric, then it is also 
self-dual. 

Moreover, the corresponding inner product can be chosen 
to be non-negative on all states, invariant under all reversible 
transformations, and to satisfy {uj,uj) = 1 for all pure states 
Lo and (uj^if) =Oifuj and ip are perfectly distinguishable. 

Remark. In quantum theory, (•, •) is the Hilbert-Schmidt inner 
product between self-adjoint matrices: {X,Y) = Ti{XY); 
invariance means that (UXW, UYW) = (X, Y) for all uni- 
taries U. In all bit-symmetric theories, if one of lu and (p is 
pure, then {uj,(p) — implies that uj and (p are perfectly dis- 
tinguishable. However, we were not able to prove that the 
same holds true in general if both are mixed. 

Proof. If w e Ha is any pure state, then there is always 
another pure state ip that is perfectly distinguishable from lu 
(unless the state space contains only a single point). Thus, 
bit symmetry implies transitivity: to every pair of pure states 
io,ip, there is a reversible transformation T E Qa such that 
T(jj = ip. This allows us to define a maximally mixed state 



as fi 



— /tgs^ where uj g Ha is any pure state. 



Due to transitivity, fi^ does not depend on the choice of cj. To 
every state uj, define its Bloch vector uj := uj — . Then we 
can decompose the space A into A = A ffi R • n^, where A is 
the set of all points x £ A with u{x) — 0, with u the unit on 
A. If cj is a state, then its Bloch vector uj is an element of A. 

Since reversible transformations preserve normalization, 
they leave the subspace A invariant. According to group rep- 
resentation theory |9J, there is an inner product (•, •) on A 
such that {Tx, Ty) = (x, y) for all T e Ga and x,y e A. We 
may scale this product by an arbitrary positive factor such that 
{uj, uj) — 1 for all pure states uj (they all have the same inner 
product due to transitivity). 



Define c := min^^^pgn^ (w, i^) < = to be the 

minimal inner product between the Bloch vectors of any two 
states. Our next step is to prove the following statements: 

(i) For all uj and (p, we have c < {uj, ip) < 1, where c < 0. 

(ii) If UJ is pure and (p is arbitrary and {uj,(f) = c, then uj 
and ip are perfectly distinguishable. 

(iii) If UJ and ip are arbitrary perfectly distinguishable states, 
then (w, <p) = c. 

To this end, define a linear map : A ^ Wi for every pure 
state UJ by linear extension of 

EU^) -.^ ^^^fl^ {^enA). 

1 — c 

Since c < 0, this is well-defined, and since {uJ,(p) > c, we 
have Ei^{ip) > for all ip G ^Ia- Due to convexity of the 
norm \\uj\\ = ^ {uj, uj), all mixed states uj satisfy < 1, 
with equality for the pure states. Thus, the Cauchy-Schwarz 
inequality implies {uj, (p) < \\uj\\ ■ \\ip\\ < 1, hence E^{ip) < 1 
for all ip e flA- In other words, for every pure state ui, the map 
E^ is a proper effect. Now suppose that uj S ^Ia is pure and 
ip e flA is arbitrary, and {uj, ip) — c. Then E^{ip) — and 
E^ {uj) — 1, hence if and uj are perfectly distinguishable. This 
proves (ii). Moreover, if c = 0, we would have {uj, p."^) = = 
c, and so uj and /i'^ would be perfectly distinguishable, which 
is impossible. Hence c < 0, proving (i). 

Choose uj,ip E riA such that {uj,ip) = c. We can de- 
compose u! and ip into pure states uji and (pj: uj — J^i oi-i^i, 
ip = J2j Pj^] with a„ (3j > 0. Since c = J^ij ctiPjii^i, 'Pj), 
and c is the minimal possible value, every addend must have 
this value due to convexity, so {uji,(pj) = c for all 
Thus uji and (pj are pure and perfectly distinguishable. Fix 
some i,]. Now if uj' and ip' are another pair of pure and 
perfectly distinguishable states, there is a reversible trans- 
formation T such that Tuji — uj' and Tipj — if' , hence 
{il}' ,(f') = {Tilji,Tifj) = {uji,ifj) — c. That is, every pair 
of perfectly distinguishable pure states has inner product c 
between its Bloch vectors. Now suppose that uj and ip are 
arbitrary perfectly distinguishable states. Decomposing them 
as above, it follows that every uji is perfectly distinguishable 
from every ipj, hence {uj, ip) — J^ij o:i(3j{il!i, ifj) = c. This 
proves statement (iii). 

Let E be any effect such that IRq • £' is an exposed ray of 
A*,. That is, there is some x E A with the following property: 



FeA*F{x) 







F = XE for some A > 0. (3) 



The point x defines a supporting hyperplane of A'^, touching 
it in the ray generated by E. Thus, either F{x) > for all 
F E A*^, or F{x) < for all F E A*^. In the last case, we 
can redefine x {—x), such that F{x) > for all F E A'^, 
or, in other words, x E {A*^_)* — A^. Since x ^ 0, we have 
u{x) 0, and uj :— x/u{x) defines a state uj E i^A which 
depends on E, and will be mixed in general. 
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Set A := maxtpgn^ i?(iy9) > and F := E/X, then 
F{uj) — 0, and the set of states ip with F{ip) = 1 is a 
non-empty face of ^Ia- Let w' be any extremal point of that 
face, then it is a pure state which is by construction per- 
fectly distinguishable from uj. Hence ~ c, and so 
Ea,>{x) = u{x)Ei^i{uj) = 0. Due to it follows that there 
is some A > such that E^i = \E. We have thus shown that 
every ray-exposed effect is of the form \' E^^i for some A' > 
and pure state lu' . According to Straszewicz' Theorem ifTOl . 
the exposed rays are dense in the set of extremal rays, hence 
every ray-extremal effect is of this form. 

Now we extend (•, •) to an inner product (•, •) on all of A. 
If a;,?/ € A, use the decomposition x — Xqh^ + x with 
X G A (and similarly for y) and define {x,y) :— Xxoyo + 
(1 - X){x, y), where A -c/(l - c) e (0, 1), since c < 0. 
It is easy to check that this is an inner product, satisfying all 
statements of the theorem. We can now identify linear func- 
tionals L : A ^ Wi with vectors L E A via L{x) = {L,x). 
Every ray-extremal effect is of the form E^{(p) ~ {lu, ip) for 
some pure state uj, hence E^i = to. Thus, in this identifica- 
tion, all extremal rays of A'^ are contained in A+. Since they 
generate the full cone A*^, we have A*^ C A^. On the other 
hand, consider an extremal ray of A^; it is spanned by some 
pure state uj. By construction, (w, tp) > for all Ld,(p E Ha, 
hence the corresponding effect E^i is contained in A*^_. Thus, 
C A'^. In summary, we get A^ = A*^ under the inner 
product (•, •) - that is, A is self-dual. □ 

In low dimensions, bit-symmetric state spaces are rare. 
Using the classification of transitive state spaces in ifTTI . 
it follows that the only bit-symmetric 2-dimensional state 
spaces are the unit disc and the regular polygons with an 
odd number of vertices. In 3 dimensions, there is only 
the unit ball (representing a qubit) and the unique regular 
self-dual polytope, the tetrahedron (representing a classical 
4-level system). For a different set of postulates leading to 
self-duality, see |13|. 

Non-locality. Given two state spaces A and B, we can con- 
sider the set of all joint states (that is, correlations) on AB 
which are consistent with the no-signalling principle |2|; this 
is called the maximal tensor product A Omax B of A and B. 
Explicitly, flAB is the set of all w G A(^B with u^(^u^ (a;) — 
1 and E^ (g) E^{uj) > for all E^ e Al,E^ e B^. 

If A and B are the square state space (j2]), then A (8)max B is 
called the "no-signalling polytope". It contains so-called PR 
boxes |1| which violate the Bell-CHSH inequality by more 
than any quantum state. It has been asked why quantum the- 
ory does not allow for such "maximally non-local" states. The 
following theorem generalizes the results in ifTSll : 

Theorem 2 The maximal tensor product A <8)max B of two 
state spaces can only be bit-symmetric if it does not contain 
any entangled states at all. 

Proof. From the definition of ^ab, it follows that all extremal 
rays of the effect cone {AB)\_ are of the form i?^ ® E^ . 



If VLab is bit-symmetric, then it is self-dual; hence, all pure 
states (generating the state cone) are product states. Since all 
states are mixtures of those, they must be unentangled. □ 
If A and B are classical ua- and ris-level systems, then 
A ®max i? is a classical riyins-level system. It is bit- 
symmetric, but does not contain any entangled states. On the 
other hand, any bit-symmetric composition AB of two state 
spaces A and B which does contain entangled states (such as 
quantum theory) must be a proper subset of A (g)niax B - there 
are at least some maximally non-local states of A (X)inax B 
which AB cannot contain. 

While the omission of some states of A (g),„ax B does not 
in itself necessarily reduce the amount of non-locality in a 
theory |T4|, this result still gives a hint that bit symmetry 
might introduce constraints on the amount of Bell inequality 
violations. This conjecture is further substantiated by the 
findings in |7 |, where it was shown that a class of composites 
of regular n-gons as in Fig. [T| satisfies the Tsirelson bound if 
and only if n is odd, i.e. the theory is locally bit-symmetric. 

Conclusions. We have shown that self-duality, one of the 
defining features of quantum theory ifTSll . follows from the 
computational primitive of bit symmetry. Thus, the power 
of reversible computation (or, equivalently, time evolution) 
severely constrains the statistical behaviour of any physical 
theory. We have also proven that bit symmetry restricts the 
set of allowed bipartite states, leaving the interesting open 
problem to quantify the consequences for violations of Bell 
inequalities. 
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